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Abstract 

Financial institutions have to allocate so-called economic capital in 
order to guarantee solvency to their clients and counterparties. Math- 
ematically speaking, any methodology of allocating capital is a risk 
measure, i.e. a function mapping random variables to the real num- 
bers. Nowadays value-at-risk, which is defined as a fixed level quantilc 
of the random variable under consideration, is the most popular risk 
measure. Unfortunately, it fails to reward diversification, as it is not 
subadditive. 

In the search for a suitable alternative to value-at-risk, Expected 
Shortfall (or conditional value-at-risk or tail value-at-risk) has been 
characterized as the smallest coherent and law invariant risk measure 
to dominate value-at-risk. We discuss these and some other properties 
of Expected Shortfall as well as its generalization to a class of coherent 
risk measures which can incorporate higher moment effects. Moreover, 
we suggest a general method on how to attribute Expected Shortfall 
risk contributions to portfolio components. 

JEL classification D81, C13. 

Keywords Expected Shortfall, Value-at-Risk, Spectral Risk Measure, 
coherence, risk contribution. 



1 Introduction 



At the latest in 1999, when the article Artzner et alj (|1999 ) appeared, it 



became clear that value-at-risk (see Definition 2.1 below) cannot be consid- 
ered a sound methodology for allocating economic capital in financial insti- 
tutions. However, even if in Artzner et al. ( 1999| ) recommendations were 
given for the properties sound risk measures should satisfy, only recently 



'Deutsche Bundesbank, Postfach 10 06 02, 60006 Frankfurt a. M., Germany; e-mail: 
tasche@ma.tum.de 

The contents of this paper do not necessarily reflect opinions shared by Deutsche Bundes- 
bank. 



1 



Expected Shortfall (Definition 3.1 below) was suggested as practicable and 
sound alternative to value-at-risk. Nevertheless, there are still a lot of use- 
ful properties of Expected Shortfall and its generalizations which cannot be 
found in printed sources so far. 

With the paper at hand, we try to make up for this omission. We will 
recapitulate in section || what makes value-at-risk a seductive measure of 
risk and what are the main criticisms against it. In particular, we will see 
a new example (Example [2.4|) for its lacking subadditivity and give a new 
interpretation (Remark ^1]) why this is an important point. 

We will then introduce in section || Expected Shortfall as a convincing 
alternative to value-at-risk. We will summarize some of its more important 
properties. These properties are shared by all the representatives of the class 
of spectral risk measures that were introduced in Acerbi ( |2002[ ) (cf. Remark 
|3.7| below). Generalizing a result from Kusuoka ( 2001 ), we show that all 
the elements of this class can be represented as certain averages of values- 
at-risk at different levels (Theorem |3.6| ). This representation allows the 
easy creation of risk measures which enjoy the useful properties of Expected 
Shortfall and incorporate other desirable features like moment effects. 

When a risk measure for a portfolio has been chosen the question arises 
how to attribute risk contributions to subportfolios. This is of interest for 
a risk diagnostics of the portfolio (see Litterman , |1996| ) or for performance 
analysis. In section ||, we present a suggestion of how to do this in case 
of spectral risk measures (Definition ^4.1| and Proposition |4.2| ). Finally, we 
show for the Expected Shortfall that these contributions can be interpreted 
as conditional expectations given a worst case scenario (Proposition f4.7|). 



2 Value-at-Risk: lacking subadditivity 

Consider a random variable X which might be seen as the random profit 
and loss of an investment by a fixed time horizon. Positive values of X are 
regarded as profits, negative values as losses. The value-at-risk (VaR) of X 
at level a is the absolute value of the worst loss not to be exceeded with a 
probability of at least a. The following couple of definitions gives a formal 
description of this quantity. 

Definition 2.1 (Quantile, value-at-risk) Let a S (0,1] be fixed and X 
be a real random variable on a probability space (f2, J 7 , P). Define inf = oo. 
We then call 

q a {X) = inf{x G M : P[X < x] > a} (2.1a) 
the a-quantile of X. We call 

VaR Q (X) = q a (-X) (2.1b) 
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the value- at-risk (VaR) at (confidence) level a of X . 



Usually, values of a close to 1 are of interest. Since by definition P\X + 
VaR a (A) > 0] > a, VaR Q (A) can be interpreted as the minimal amount of 
capital to be put back by the investor in order to preserve her solvency with 
a probability of least a. 

Below, we will compare VaR to other methods for attributing capital 
to random variables (sometimes in insurance contexts also called risks). A 
positive capital attribution means that the risk under consideration requires 
capital whereas a negative capital attribution indicates that capital may be 
released. From an economical point of view, it makes sense to allow for risks 
which require a positively infinite amount of capital. A risk with capital 
requirement oo must not be accepted by the investor. The interpretation 
of a risk with capital requirement — oo is much less straightforward. Would 
this imply that such a risk can serve as a collateral for any risk with finite 
capital requirement? However, this case does not appear very likely and is 
therefore excluded from the following definition of risk measures. 

Definition 2.2 (Risk measure) Let (Cl,J-, P) be a probability space and 
V be a non-empty set of ^-measurable real-valued random variables. Then 
any mapping p : K^RU {00} is called a risk measure. 



VaR, as a risk measure in the sense of Definition |2.2| , enjoys most of the 
properties that are considered useful in the literature ( Artzner et al. , 199£ ; 
Kusuoka| , |200l|) . 



Proposition 2.3 (Properties of value-at-risk) Let a G (0, 1] be fixed 
and (D,, J-, P) be a probability space. Consider the risk measure p on the 
set V of all the T -measurable real-valued random variables which is given 

by 

p(X) = VaR a (A), X £ V. (2.2) 
Then p has the following properties: 

(1) Monotonicity: A, Y £ V, X < Y => p{X) > p(Y). 

(2) Positive homogeneity: X £ V,h > 0,hX £ V =>- p(hX) = hp(X). 

(3) Translation invariance: X £ V, aGM, X + a £ V p(X + a) = 
p(X) - a. 

(4) Law invariance: X, Y £ V, P[X < t] = P[Y < t] for allt£R 
^p(X)=p(Y). 

(5) Comonotonic additivity: f,g non- decreasing, Z real random variable 
on (f2, P) such that foZ,goZ£V 

=> p{f o Z + g o Z) = p(f o Z) + p(g o Z). 
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Proof. (1) until (4) are obvious. For (5), see e.g. Denneberg (1994). □ 



Note that VaR a is law invariant in a very strong sense: the distributions 
of X and Y need not be identical in order to imply VaR a (X) = VaR a (Y). 
A certain local identity of the distributions suffices for this implication. In 
particular, random variables X with light tail probabilities and Y with heavy 
tail probabilities (see e.g. Embrechts et al. , 1997| ) may have the same VaR a . 



This point is one main criticism against VaR as a risk measure. 

One important property is missing in the enumeration of Proposition 



2.3: the subadditivity, i.e. 

X,Y £V,X + Y £V => p(X + Y)<p{X)+p(Y). (2.3) 

It is well-known that VaR is not in general subadditive. Here we present a 
counterexample with continuous and even independent random variables. 

Example 2.4 (Lacking subadditivity of VaR) Let X\,X 2 Pareto dis- 
tributed with values in (— oo, 1) and independent. The joint distribution of 
(X, Y) is specified by 

PLYi < xi, X 2 < x 2 ] = (2-xi)- 1 (2-x 2 r\ xi,x 2 <l. (2.4a) 
This implies 

X&R a (Xi) = (1 - a)' 1 - 2, i = 1, 2, (2.4b) 

A — x (4 — x) A 

By flmi\) , we have VaR Q (Vx) + X&R a (X 2 ) < YaR a (X 1 + X 2 ) for all a £ 
(0, 1) because 

P[-(X 1 + X 2 )<2V a R a (X 1 )] = «-±^logi±f 

(2.4c) 

< a. 

In particular, for a = 0, 99 we have 

VaR Q (X 1 )= VaR a (X 2 ) = 98, VaR Q ( X x + X 2 ) « 203 , 2 . 

The lacking subadditivity of VaR is criticized because under certain circum- 
stances it might be an incentive to split up a large firm into two smaller firms. 
Another interpretation (Remark |2.6| ) follows from the following result. 

Proposition 2.5 Let X, Y be real, linearly independent random variables 
and p be a real-valued risk measure on the positive cone C spanned by X and 
Y , i.e. C = {mI+v Y : u, v > 0}. Assume that p is positively homogeneous 
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in the sense of Proposition j| (2) and that the function p(u,v) = p(uX + 
vY), u, v > is differentiable in (u,v). Then we have 



p(U 1 + U 2 ) < p{U x ) + p{U 2 ), U u U 2 eC, (2.5a) 

if and only if 

PuAUi + U 2 ) < p{U x ), pu^Ux + U 2 ) < p(U 2 ), U u U 2 e C, (2.5b) 
with 

ITT TT \ 9P I \ dP / \ 

PUi{U\ + U 2 ) = U iQ^\ U l + u 2,Vl + V 2 ) + Vi—\Ul + U 2 ,Vl + V 2 ) 



dv 



when Ui = ui X + ViY , i = 1,2 (note that by linear independence this 
representation is unique). 



Remark 2.6 By Euler's relation (see (\2. 6d\j ), the terms pu x {U\ + U 2 ) and 
Pu 2 (Ui + U 2 ) from ( 2. 5b ) sum up to p(Ui + U 2 ). Hence it appears quite 
natural to regard them as the risk (or capital) contributions of U± and U 2 
respectively to the total capital p{U\ + U 2 ) which is required by U\ + U 2 . 
Indeed, it can be argued that there is no other way to arrive at a reasonable 



notion of capital contribution than by partial derivatives (cf. Denauli , 2001 
\Tasch 

'tion 



19% ). Moreover, VaR and the risk measure ES to be defined below 
(Definition \3. 2\) satisfy the conditions of Propositi 



2.5 under quite general 



assumptions on the joint distribution of (X,Y) (cf. Tasche , 200C ). 

With this interpretation of pu^Ui +U 2 ), i = 1,2, the meaning of ( 2.5b ) 



is as follows: the manager who is responsible for subportfolio U\ will never 
be damaged by diversification in the portfolio of the firm because her capital 
contribution will never be greater than the capital requirement in the case of 
U\ considered as a stand-alone portfolio. 



Proof of Proposition 2.5. We show first that ( [2.5a ) implies pu^Ui+U^ < 
p(Ui). Fix Ui = Ui X + Vi Y, i = 1,2, and note that p{U\) = p(u\,v\) and 
p{U\ + U 2 ) = p(u\ + u 2 , v\ + v 2 ). Define the function / : (—1, oo) — > R by 



f(t) = p(m + u 2 ,vi + v 2 ) + tp(u 1 ,v 1 ) 

- p{{l + t)ui+u 2 ,{l + t)vi+v 2 ). 



Then 



dp 



f'{t) = p( Ul , Vl ) - U! ^ ((1 +t)ui + U 2 , (1 +t)v X + V 2 ) 

dp 



vi 



dv 



((l + t) Ul +u 2 ,(l+t)vi + v 2 ) 



(2.6a) 



(2.6b) 
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and in particular 



/(O) = 0, 

ell dP I \ dP I \ 

f (0) = p{ui,vi) -Ui—{ui +U 2 ,Vl + V 2 ) +Vi—{Ul +U 2 ,V! +v 2 ) 



du 

p(Ui)- P uAUi + u 2 ). 



(2.6c) 



( |2.5a| ) implies for t > that f(t) > 0. But, by fl2.6cj ), this is a contradiction 
to the assumption p{U x ) - PuA u i + U 2 ) = /'(0) < 0. This implies (|2.5b|) . 

Let us now consider the proof of the implication ( p. 5b ) =>■ ( |2.5a| ). This 
is easy since by Euler's relation and ( p. 5b ) 



p{Ui + U 2 ) = p(ui + u 2 ,vi + v 2 ) 

Op 



(ui + u 2 ) — [u\ +U 2 ,Vl + v 2 ) 



dp 



(2.6d) 



- (yi + v 2 ) — (ui +u 2 ,vi + v 2 ) 
< p(U x ) + P (U 2 ). 
This completes the proof of Proposition iLE. 



□ 



3 Spectral risk measures 

The weak points of VaR as a risk measure are well-known for some time 
(cf. [Artzner et ah , 1999| ). Nowadays, there is a certain consensus on the 
properties a reasonable risk measure should satisfy ( Artzner et al, , 1997] , 



199£; Delbaen, 1995; but see also Follmer and Schied, 2002, for a relaxation) : 



it should be coherent in the sense of the following definition. 



Definition 3.1 (Coherent risk measure) A risk measure p : V — > K U 
{oo} in the sense of Definition *2Jk is called coherent if it is monotonous, 
positively homogeneous, translation invariant, and subadditive (see Proposi- 
tion^ (1), (2), (3), and Eq. 



In order to preserve the desirable connection between the level of VaR Q and 
the investor's probability of solvency, it would be nice to have a smallest 
coherent risk measure to dominate VaR a . As was shown in Delbaen ( 1998| ) , 
such a smallest coherent major ant to VaR a does not exist. Nevertheless, 
m Pelbaenl (|1998|) was also shown there is a smallest coherent and law in- 
variant (see Proposition (4)) risk measureQthat dominates VaR a . The 



The term "law invariance" was introduced in |Kusuoka| ( p00ll ). A rough interpretation 
of law invariance might be "can be estimated from statistical observations only" . Anyway, 
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representation of this measure in Dclbacn ( p.998| ) was not explicit in the gen- 
eral case. However, it became clear that for continuous random variables X, 
it coincides with E[— X \ — X < VaR Q (X)] , the so-called tail value- at-risk. 
Note that tail value-at-risk, in general, is not subadditive (see e.g. Acerbi 
and Taschej $00% )- 

Denote - as usual - by \a = 1a (o) the indicator function of the set A, 
i.e. 1a(o) = if a ^ A and 1a{o) = 1 if a G A. 

Definition 3.2 Let a £ (0, 1) be fixed and X be a real random variable on 
a probability space (Q,J-,P) with E[max(0, — X)] < oo. Define q a (—X) as 
in Definition [<?. j| . We then call 

ES a (X) = -(1 - a)- 1 (E[X 1 { - X > qa (-X)}] 

+ q a (-X)[a-P[-X <q a (-X)]^ (3.1) 
Expected Shortfall (ES) at level a of X. 



It turned out flKusuokaj , [200l| ; [Acerbi and Taschd, , p002| ) that ES from Defi- 
nition 3^ is just the smallest coherent and law invariant majorant of VaR Q 
which had been already mentioned in Delbaen ( 1998| ). The term ES stems 
from Acerbi et al. ( 2001 ) where a further proof of the coherence of ES was 
given. Independently, ES was introduced in Rockafellar and Uryasev (2001) 
under the notion Conditional value-at-risk (CVaR). The properties of ES are 
discussed in detail in Acerbi and Tasche ( |2002|) and |Rockafellar and Uryasev 
(|2001[) . 

The following result (Acerbi and Tasche, 2002j ; Pflug, 2000) is important 
for the calculation of VaR and ES, and, by the way, enlightens the rela- 
tionship between the notion of ES and the quantile regression which was 



introduced in Koenker and Bassett ( 1978 ). ES is just the optimal value in 
an optimization problem where —VaR is the optimizing argument. 



Proposition 3.3 For ES a as given in Definition $\ and q a , qi— a as given 
in Definition |g. jL we have 



ES a {X) =min-(l-a)~ 1 (E[Xl { „ x > s} ] +s\a-P[-X < s)\) (3.2a) 



and 



[q a (-X),- qi ^ a (X)] = argmin-(l - a) 1 (e[X1 { _ x>s} 



+ s{a-P[-X < s]}), (3.2b) 



as VaR is law invariant it seems natural to look for its smallest coherent and law invariant 



majorant. See eq. (1) in Acerbi ( 20021 ) for an example of a risk measure which is not law 
invariant in sense of Proposition 2.3| (4) . 
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whenever X is a real random variable with E[max(0, — X)] < oo. 



Proof. Proposition 4.2 in Acerbi and Tasche ( |2002j ). □ 

Note that the interval in ( |3.2b| ) is never empty and that VaR a (X) = 
q a {—X) by definition. Let us now have a look on another useful represen- 
tation of ES. 



Proposition 3.4 For ES a as given in Definition \3. 1} and VaR a as given in 
Definition 2.1, we have 

ES Q (X) = (1 - a)" 1 I VaR u (X) du, (3.3) 

whenever X is a real random variable with E[max(0, — X)] < oo. 
Proof. Proposition 3.2 in Acerbi and Tasche] ( |2002j ). 



□ 



In combination with Proposition 2.3, Proposition implies that ES is 
a law invariant and comonotonic additive risk measure. The comonotonic 
additivity of a risk measure becomes particularly interesting when it occurs 
at the same time as subadditivity. 

Remark 3.5 Fix a £ (0, 1) and consider integrable random variables X and 
Y. Assume that we do not know the joint distribution of X and Y . Then, 
from subadditivity, we see that ES Q (X) + ES a (y) is an upper bound for the 
risk of X + Y when risk is measured by ES. By comonotonic additivity, we 
know additionally that this upper bound is sharp in the sense that it occurs 
in the case of comonotonic X and Y (i.e. X = f o Z and Y = g o Z for 
some random variable Z and non- decreasing functions f and g). 

Compare this to the situation when VaR a is used as risk measure. Then 
there is no easy general upper bound for the risk of X + Y , and finding the 
joint distribution of X andY which yields the maximum value for ■ VaR Q (X+ 
Y) is a non-trivial task (Embrechts et ai, 2001; Luciano and Marena, 200 \ ). 



Note that there are coherent and law invariant risk measures which are not 



comonotonic additive (e.g. the standard semi-deviation, see Fischer , 2001 ). 

It might have become clear from the above considerations that the class 
of coherent, law invariant and comonotonic additive risk measures is of par- 
ticular interest. In Kusuoka ( 2001] ), a complete characterization of this class 
was accomplished, under the additional assumption that the risk measures 
under consideration satisfy the so-called Fatou property. We show that this 
assumption is dispensable. 

Theorem 3.6 Let p be a risk measure on the space V of the bounded random 
variables in the probability space (fl,J-, P). Assume that (O, T , P) is stan- 
dard and non-atomic (i.e. there exists a random variable which is uniformly 
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distributed on (0, 1) ). Then p is a coherent, law invariant and comonotonic 
additive (see Definition i| and Proposition \2. <j| (4), (5)) risk measure if and 
only if 



p(X) = p I VaR u (X) F(du) + (1 -p) VaRi(X), X € V, 
Jo 



(3.4) 



where p £ [0, 1] and F is a continuous convex distribution function which is 
concentrated on [0, 1]. 

Remark 3.7 

(i) Choose p = 1 and F(u) = max(0, j5^) *n order to obtain ES a from 

(ii) Note that any continuous and convex distribution function F on [0, 1] 
is absolutely continuous, i.e. can be written as F{u) = L f(t)dt where 
f is its density with respect to Lebesgue measure. Thus Theorem 3A. 
states that the class of spectral risk measures which was introduced in 



Acerb; ( 200i ) is just the class of coherent, law invariant and comono- 



tonic additive risk measures. 



(Hi) Formulas like (3.4) can be traced back a long time in the actuarial 



literature (cf. Wang , 1996 , and the references therein). 



Proof of Theorem |3.6| . Let us first regard the case where a risk measure 
p as in (3.4) is given. Law invariance of p is then clear since ( |3.4| ) is based 
on quantiles of X. If p = then p is just the essential supremum of X. It 
is then obvious that p is coherent and comonotonic additive. 

Assume now p > 0. Construct a function Fq : [0, 1] — ► [0, 1] by setting 



F (u) 



pF(u), 0<u<l 
1, u = l. 



(3.5) 



Observe that Fq is again convex and non-decreasing but may fail to be 
continuous in 1. Nevertheless, it is easy to show that ( |3.4| ) is equivalent to 



I 



XdF oP, XeV, 



(3.6) 



where J X dFo o P denotes the non-additive integral with respect to the 
distorted probability FqoP in the sense of Dennebe"rg| ( |1994 ). Coherence 
and comonotonic additivity of p are now just conclusions from the general 
theory of non-additive integration. 

Next, we show that any coherent, law invariant and comonotonic additive 



risk measure p can be represented as in (|3.4j ). As a first step, we conclude 
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from the results in Schmeidlei (1986, Schmeidler's theorem) that p can be 
written as 



p{X) = - J Xdv, X£V, (3.7) 
where j X dv denotes again a non-additive integral in the sense of |Denj 



nebergj ( |1994| ). v is a monotonous (i.e. A, B G J 7 , A C B => v(A) < v{B)) 



and super-modular (i.e. A, B G T => z/(A) + z^(B) < i/(A U 5) + z^(^l n B)) 
set function (i.e. z/(0) = 0) on (£l,J~) with i/(Q) = 1. We define a function 
F : [0, 1] -> [0, 1] by 

F (u) = (3.8a) 

for any igf with P[A] = u. Since (O, P) is standard and non-atomic, 
for every u G [0, 1] there is at least one A G T with PL4] = u. The law in- 
variance of p implies that Fq is with ( |3.8a| ) is well-defined. The monotonicity 
of v implies that Fq is non-decreasing. Moreover, from (|3.8a|) also follows 



u = F oP. (3.8b) 

Again, since (O, J 7 , P) is standard and non-atomic, for any ui, 112, U3, 114 G 
[0, 1] with < U\ < U4 < 1, 11.2,113 G [«i,U4] and 112 — Ui = U4 — 1*3 there are 
events A,B £ J 7 such that we have 

P[A] = u 3 , P[B] = n 2 , P[^ n 5] = ui, P[A UB] = U4- (3.9) 

The super-modularity of v and (|3.9|) imply 

FoK) + F («i) > F (u 2 ) + F (u 3 ) (3.10) 



for all ui,U2,U3,«4 as above. With ui = U3, (13.10 ) yields for any < u < 
v < 1 that 

^(-5-) < -F (u) + -F (v). (3.11a) 

Of course, from ( |3.11a| ) we obtain 

F (au + (l- a)v) <aF (u) + (1 - a)F (u) (3.11b) 

for every a G : n > 1, A; = 0, l,...,2 n }. Since Fo is non-decreasing, 
limits from the right exist in au + (1 — a)v for every a G (0,1). Hence, 
by passing to the limits in ( |3.11b| ) we can conclude that (3.11b) holds for 



every a G [0, 1], i.e. Fq is convex. Observe that a function Fq : [0, 1] — * [0, 1] 
with Fq(0) = and Fq(1) = 1 is necessarily continuous on [0,1) if it is 
non-decreasing and convex. Furthermore, Fq can be constant at most on an 
interval [0, e). On [e, 1] it will then be strictly increasing. 

So far, we know that p can be represented by ([O]) where Fq : [0, 1] — > 
[0, 1] is non-decreasing and convex as described above. Now, applying the 
definition and some other properties of non-additive integrals yields repre- 
sentation (fOh where p is given by p = sup u6 r 0jl ) Fq(u) and F and Fq are 
related by (^). □ 
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Remark 3.8 (Incorporating moment effects in ES) 

Representation \3.6J allows in some cases a helpful interpretation of what 
happens when p(X) is calculated. Fix any positive integer n. Recall from 



Remark \3. r \ (i) the function F which generates ES when used in ( \3.4) - Define 



M n (u) = F(l - 



u € [0,1]. 



(3.12) 



If X is a real random variable, interpreted as the profit and loss of a finan- 
cial asset, consider independent and identically distributed Y,Yi, . . . ,Y n with 
P[-Y < t] = 1 - yP[-X > t}. Then 

P[-X < t] =P[min(-Yi,... : 
F(P[Y>t}) =M n (P[X>t]), 



-Y n ) < t 



(3.13) 



and hence 



J YdFoP = J 



XdM r , O P. 



(3.14) 



By Theorem^, hence ES ( a\x) = ES a (Y) is a spectral risk measure. Note 
that (cf. \Delbaerj , 



E[max(0, —Y)] < oo 
and for any e > 



E[(max(0,-X)) n ] < oo (3.15a) 



E[(max(0, -X)) n+e ] < oo => E[max(0, —Y)] < oo. 



(3.15b) 



( 3.15a ) and ( 3.15b ) show that ESi™^(X) is sensible to the n-th moment of 
X. By (3.13) it may be interpreted as the Expected Shortfall of a random 
variable Y which is generated from X by a pessimistic manipulation since the 
loss variable —X has the same distribution as the minimum ofn independent 
copies of the loss —Y. 



4 Risk contributions 

In this section we study the following problem: Given random variables 
X\, . . . , Xd (e.g. profits and losses of the different business lines in a financial 
institution), portfolio weights u±, . . . , u^, and a risk measure p, we want to 
know how much U{Xi contributes to the total risk p(Yli=i u i Xi) of the 
portfolio. With u = (u±, . . . , u<i) write for short 

d 

p(u) = p{Y^u t X t ). (4.1) 

8=1 
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Denaultl ( pOOl| ) and [Taschej ( |l99e| ) (with different reasonings) argued that 



Ui -^-(u) is the appropriate definition for the risk contribution of Ui X- t in 
the case when p(u) is partially differentiable with respect to the components 
of u. 

The question of how to compute the partial derivatives in case p = 
VaR a was independently tackled by several authors flGouricroux et ah , 200C ; 



Hallerbach, 1999|; Lemus|, 1999 


; Tasche 




1999) 



certain smoothness assumptions on the joint distribution of (X\, . . . , X^) 
d 



<9VaR Q 
dm 



d d 

u j x j) = - E Xi | u o x 3 = VaR ° E u o x j) ■ ( 4 - 2a ) 
j=i j=i j=i 

Note that in case of integrable components X±, . . . , the right-hand side of 
( 1.2a| ) will always be defined. By the factorization lemma there are functions 
fa = cpi(u; z), i = 1, . . . , d, such that almost surely 



E 



Xi 



u i X i 



fa(u;z). 



(4.2b) 



By inspection of (3.4), (4.2a) and ( (4.2b| ) suggest the following definition for 
risk contributions in case of a spectral risk measure in the sense of Remark 
(ii). 



Definition 4.1 (Risk contributions of spectral risk measures) 

Let X±, . . . , Xj be integrable random variables on the same probability space, 
u = (ui, . . . , Ud) their portfolio weight vector, and p be a spectral risk measure 
with representation Define functions fa = fa(u\z), i = l,...,d, by 

ft4-%b\ ) an d write VaR a (u) for VaR Q (^^ =1 Uj Xj} . Then, if all the involved 
integrals exist, the quantity 

Piiu) = -p [ fa(u;-V&R a (u)) F(da) - (1 - p) fa(u; -VaRi(u)) (4.3) 
Jo 

is called marginal impact of Xi on the total risk p{u) = p(5Z~ =1 Uj Xj} . The 
quantity n, pi(u) is called risk contribution of UiXi to the total risk. 



By the standard theory of non-additive integration (see Denneberg, 1994| ) 
we obtain the following equivalent representation of the risk contributions 



from Definition 4.1. 



Proposition 4.2 Assume that the random variables X\, . . . , X^ from Defi- 
nition^^ are defined on the probability space (£l,J-,P). Then the marginal 
impacts Pi(u) in l\4-$) can be equivalently written as 



fa (u; 



d 

£ 

3=1 



U 



Xj) dF o P, 



(4.4) 
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where Fq is given by j\3.5] ). As a consequence, for fixed u, the value of Pi(u) 
does not depend on the choice of <f>i(u; •). 



Proof, flop <^=^ (4.4) can be proved like in the proof of Theorem 3.£. 
Denote ^2j=±Uj Xj by Z(u), and let 4>i(u\z) and <j>*(u;z) be two versions 
of E[Xj|Z(u) = z\. Then, 4>i(u; z) and <p*(u;z) are equal for all z but 
those in a set N with P[Z(u) G IV =0. Of course, then we also have 
(F o P) [Z(u) G N) = 0. This implies 

J Z(u)) dF oP = J Z(n)) dF o P. 

Thus, the proof is accomplished. □ 



Remark 4.3 ( \4-4 ) suggests the following procedure for the estimation of the 
marginal impacts Pi(u) on the spectral risk measure p(u) = p(X^/=i u j Xj) '■ 

1. Estimate the conditional expectations <pi{u; •) (see ( \4-2l\ ); could be done 
by a kernel estimation). 

2. Estimate the distribution of Y2j=i u j -^j ( cou ld be done by a kernel 
estimation of the density). 

3. Resample from the distribution of Ylj=i u j -^j distorted by Fq, apply 
4>i(u;-) on the sample, and estimate pi{u) with the ordinary sample 
mean. 



The representations Q4.3j) and (4.4) of the marginal impacts on spectral risk 



measures can be significantly simplified in case of the Expected Shortfall 
ES a (Definition |3.2[ ). To see this we need the following two results. 



Proposition 4.4 Let X be a real random variable, f : R — ► [0, oo) a func- 
tion such that E[max(0, — /oX)] < oo and let a G (0, 1) be a fixed confidence 
level. Then 

^ f{q u (X))du = E[/oXl {x < 9a(x)} ] 
Jo 

+ f{q a (X))(a-P[X<q a (X)}). (4.5) 

Proof. By switching to another probability space if necessary, we can as- 
sume that there is a real random variable U that is uniformly distributed on 
(0, 1), i.e. P[J7 < u] = u, u £ (0, 1). It is well-known that then the random 
variable Z = qjj(X) has the same distribution as X. 

Since u t— > q u (X) is non-decreasing we have 

{U < a} C {Z < q a (X)} and 
{U > a} n {Z < q a {X)} C{Z = q a (X)} . 
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By ( |4.6p we obtain 

[ a f(q u (X))du = E[foZl {u < a} ] 
Jo 

= E[f O Z l{Z<q a (X)}] ~B[foZ l{U>a}n{Z<q a (X)}] 

= E[foX l { x< qa (x)}} + q a (X) (a - PLY < q a (X)}) . 

(4.7) 

Thus, the proof is accomplished. □ 
Remark 4.5 Prop. \/^.J\ generalizes Prop. 3.2 of \Acerbi and Tasche ( 200'\ ). 



The "< " in ( ^5 ) may be replaced by "< 



Corollary 4.6 Let X,Y be real random variables such that E[|Y|] < oo 
and let a S (0, 1) be a fixed confidence level. Then 



I 



1 E [Y | X = -VaR M pO] du = E[Y l { -x> 9a (-x)}] 

+ E[Y | - X = q a (-X)} (P [-X < q a (-X)} - a) . (4.8) 



Moreover, the value of f^~E[Y\X = — VaR u (X)] du is the same for any 
version of the conditional expectation. 

Proof. Non-dependence on the particular version of conditional expectation 



follows from Proposition 4.4. Observe that 



I E[Y \X = -VaRupO] du = [ E[Y \ - X = q u {-X)] du 

Ja Jo 

- [ a E[Y\ -X = q u {-X)]du 
Jo 

= E[Y] - E[Y | - X = q u (-X)] du. 
Jo 

(4.9a) 

Proposition L4 and Remark applied to f(x) = E[y | — X = x] , yield 

[ a E[Y\-X = q u (-X)] du = E[Y l { _ x<ga (-x)}] 
Jo 

+ E [Y | - X = q a (-X)] (a-P [-X < q a (-X)} ) . (4.9b) 



( 4.9a| ) and ( [4.9b[ ) imply the assertion. □ 

Recall Definition ^4] of the marginal impact pi (u) of a component Xi on 
the total risk p{u) of a portfolio Ylj=i u j Xj when p is a spectral risk measure. 
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This definition applies to p = ES a with p = 1 and F(u) = max(0, fr^)- I n 
this case, Corollary [4.6| implies the representation (with Z(u) = Y^,j=i u j Xj) 

Pi{u) = -(1 - a)" 1 ^E[Xi l{-z(u)> qa (-z{u))}] 

+ E[X t \ -Z(u) = q a (-Z(u))](p[-Z(u)<q a (-Z(u))]-aj\. (4.10) 

Returning to ( f4.8|) , we will show that its right-hand side times (1— a) -1 (and, 
as a consequence, also Pi(u) from( f4"Tl~0| )) can be interpreted as a conditional 
expectation given that a certain worst case event has occurred. Note first 
that by the very definition of quantile we have 

< P [-X < q a (~X)] - a < P [-X = q a (-X)] (4.11a) 

and in particular 

P [-X < q a (-X)] - a + P [-X = q a (-X)] > 0. (4.11b) 

Hence, it makes sense to define a {0, l}-valued random variable J = Jxa 
with 

P[J = 1] = p a = 1-P[J = 0], (4.12a) 

where 

[0, otherwise. 



Proposition 4.7 Lei X, 1" 6e real random variables such that E[|y|] < oo 
and a £ (0, 1) a fixed confidence level. Suppose that there is a random 
variable J which satisfies ( 4.12a ), ( 4-12f\) and is independent from (X, Y). 
Define 

(4.13a) 
u. (4.13b) 



, -{-X>q a (-X)}U{-X=q a (-X),J=l}- 



Then 



E[Y 



1 = 1} = (i-a)- 1 f E[F|X = -VaR u (X)] d 

J a 



Proof. By ( |4.12b| ) and the independence of I and (X, Y) we have 

P[J = 1] = 1 - a. (4.14) 
It is now straightforward to see that 

E[yi {/=1} ]=E[ri { _ x > 9a( _ x)} ] 

+ E[Y\-X= q a (-X)] (P[-X < q a (-X)} - a) . (4.15) 
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Thus, the assertion follows from Corollary ^1]. □ 

The philosophy behind value-at-risk (VaR) is that the event {—X < 
q a (—X)} is tolerable whereas {—X > q a (— X)} corresponds to a kind of 
default. Note that 

P[-X>q a (-X)] < 1-a. (4.16) 

Hence one might consider / from Proposition [O] an indicator of { — X > 
q a (— X)} modified in a way that enlarges the probability of default. Setting 
Y = X in ( p~13bl) shows that ES itself may be regarded as a conditional 



expectation in a worst case scenario. Replacing X by Z(u) and Y by Xi 
shows that the same holds for the ES marginal impacts from (|4.10| ). 

Observe that ( |4.13b| ) is also a statement about how to estimate ES and 
the ES marginal impacts. Assume that an independent, identically dis- 
tributed sample {X\^ . . . , X^i), i = 1, . . . , N, of the portfolio component 
returns is given (cf. ( ]4. 10[ ) ) . Let Zj = Ylj=i u j i = 1, . . . ,N. 

• First estimate q a (— Z) from (Zi, . . . , Zjv) by some number q. 

• Estimate the probabilities P[— Z < q a (— Z)l and P[— Z = q , Q (— Z)]. 
Let p s and p e denote the corresponding estimators. 

• Determine a sub-sample by taking all those i such that — Zj > g or 
— Zj = g and an additional independent Bernoulli experiment with 
success probability Ps ~ a (only in case p e > 0) results in 1. 



Estimate ES Q (Z) and the marginal impacts according to ( [4.10 ) as 
negative averages of this sub-sample. 
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